Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer
نویسندگان
چکیده
Analysis of microarray data is associated with the methodological problems of high dimension and small sample size. Various methods have been used for variable selection in high-dimension and small sample size cases with a single survival endpoint. However, little effort has been directed toward addressing competing risks where there is more than one failure risks. This study compared three typical variable selection techniques including Lasso, elastic net, and likelihood-based boosting for high-dimensional time-to-event data with competing risks. The performance of these methods was evaluated via a simulation study by analyzing a real dataset related to bladder cancer patients using time-dependent receiver operator characteristic (ROC) curve and bootstrap .632+ prediction error curves. The elastic net penalization method was shown to outperform Lasso and boosting. Based on the elastic net, 33 genes out of 1381 genes related to bladder cancer were selected. By fitting to the Fine and Gray model, eight genes were highly significant (P<0.001). Among them, expression of RTN4, SON, IGF1R, SNRPE, PTGR1, PLEK, and ETFDH was associated with a decrease in survival time, whereas SMARCAD1 expression was associated with an increase in survival time. This study indicates that the elastic net has a higher capacity than the Lasso and boosting for the prediction of survival time in bladder cancer patients. Moreover, genes selected by all methods improved the predictive power of the model based on only clinical variables, indicating the value of information contained in the microarray features.
منابع مشابه
Boosting for high-dimensional time-to-event data with competing risks
MOTIVATION For analyzing high-dimensional time-to-event data with competing risks, tailored modeling techniques are required that consider the event of interest and the competing events at the same time, while also dealing with censoring. For low-dimensional settings, proportional hazards models for the subdistribution hazard have been proposed, but an adaptation for high-dimensional settings i...
متن کاملComparison of Random Survival Forests for Competing Risks and Regression Models in Determining Mortality Risk Factors in Breast Cancer Patients in Mahdieh Center, Hamedan, Iran
Introduction: Breast cancer is one of the most common cancers among women worldwide. Patients with cancer may die due to disease progression or other types of events. These different event types are called competing risks. This study aimed to determine the factors affecting the survival of patients with breast cancer using three different approaches: cause-specific hazards regression, subdistri...
متن کاملطول عمر بیماران مبتلا به سرطان معده پس از عمل جراحی:تحلیلی بر اساس رقابت جویی خطرات
Background and Aim: Many researchers have studied survival (time to death) of gastric cancer patients. Although gastric cancer diagnosed in early stages can be cured by surgery, chance of relapse still exists after operation. Hence, we should consider both events, that is, relapse of the disease and death, in order to be able to make a more precise estimation for survival of the patients. The p...
متن کاملFactors Affecting the Risk of Death in Patients with Rectal Cancer: An Analysis in the Presence of Competitive Risks
Background and Objectives: The incidence of rectal cancer is increasing in developing societies, especially in younger age groups. The aim of this study was to evaluate the factors affecting the survival of patients with rectal cancer in the presence of competing risks. Methods: In this retrospective cohort study, the data of 121 patients with rectal cancer during 2001-2017 were studied. De...
متن کاملMissing covariates in competing risks analysis
Studies often follow individuals until they fail from one of a number of competing failure types. One approach to analyzing such competing risks data involves modeling the cause-specific hazards as functions of baseline covariates. A common issue that arises in this context is missing values in covariates. In this setting, we first establish conditions under which complete case analysis (CCA) i...
متن کامل